Automatic Classification of Structured Product Labels for Pregnancy Risk Drug Categories, a Machine Learning Approach
نویسندگان
چکیده
With regular expressions and manual review, 18,342 FDA-approved drug product labels were processed to determine if the five standard pregnancy drug risk categories were mentioned in the label. After excluding 81 drugs with multiple-risk categories, 83% of the labels had a risk category within the text and 17% labels did not. We trained a Sequential Minimal Optimization algorithm on the labels containing pregnancy risk information segmented into standard document sections. For the evaluation of the classifier on the testing set, we used the Micromedex drug risk categories. The precautions section had the best performance for assigning drug risk categories, achieving Accuracy 0.79, Precision 0.66, Recall 0.64 and F1 measure 0.65. Missing pregnancy risk categories could be suggested using machine learning algorithms trained on the existing publicly available pregnancy risk information.
منابع مشابه
Multi-label Classification of Product Reviews Using Structured Svm
Most of the text classification problems are associated with multiple class labels and hence automatic text classification is one of the most challenging and prominent research area. Text classification is the problem of categorizing text documents into different classes. In the multi-label classification scenario, each document is associated may have more than one label. The real challenge in ...
متن کاملText Mining and Classification of Product Reviews Using Structured Support Vector Machine
Text mining and Text classification are the two prominent and challenging tasks in the field of Machine learning. Text mining refers to the process of deriving high quality and relevant information from text, while Text classification deals with the categorization of text documents into different classes. The real challenge in these areas is to address the problems like handling large text corp...
متن کاملTitle: A Supervised Machine Learning Framework for the Extraction of Drug-Drug Interactions from Structured Product Labels Authors and affiliations:
Background: Information about drug-drug interactions (DDIs) is found in the medical literature and in drug package inserts published on DailyMed in addition to commercial drug databases. Objectives: To develop a machine learning framework for the extraction of DDIs from structured product labels (SPLs). Methods: We develop a supervised machine learning framework (support vector machine classifi...
متن کاملAutomatic road crack detection and classification using image processing techniques, machine learning and integrated models in urban areas: A novel image binarization technique
The quality of the road pavement has always been one of the major concerns for governments around the world. Cracks in the asphalt are one of the most common road tensions that generally threaten the safety of roads and highways. In recent years, automated inspection methods such as image and video processing have been considered due to the high cost and error of manual metho...
متن کاملPreventing adverse drug events by extracting information from drug fact sheets
Background: The increasing volume and growing complexity of drugs lead to an increased risk of prescription errors and adverse events. A correct drug choice must be modulated to acknowledge both patients’ status and drug-specific information. This information is reported in free-text on drug fact sheets. It is often overwhelming and difficult to access. There is thus a rising need for generatin...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- AMIA ... Annual Symposium proceedings. AMIA Symposium
دوره 2015 شماره
صفحات -
تاریخ انتشار 2015